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I  SUMMARY 


In  this  research  project  the  meaning  of  image  matching  in  the  symbolic 
sense  was  explored  in  detail  as  well  as  the  nature  of  the  algorithms  required 
to  do  image  matching.  It  was  determined  that  image  matching  was,  in  fact, 
a  relational  homomorphism  problem  and  that  relational  homomorphism  was  a 
kind  of  constraint  satisfaction  problem  that  is  called  consistent  labeling. 

Consistent  labeling  problems  are  prevalent  in  image  understanding  work: 
for  example,  line  labeling,  and  region  interpretation  are  consistent  labeling 
probelms.  Having  realized  that  the  hard  computational  problems  were  consistent 
labeling  problems,  our  efforts  were  concentrated  on  efficient  algorithms  for 
solving  them. 

Most  of  our  wojrk  generalized  the  relaxation  operators  of  Ullman  and 
Waltz  and  showed  how  what  they  produced  related  to  either  homomorphisms 
or  consistent  labeling  problems.  After  we  finished  these  theoretical 
generalizations,  we  began  empirical  studies  measuring  the  improvement  of 
the  relaxation  algorithms  over  standard  backtracking. 

Although  due  to  AFOSR  administrative  rules,  the  grant  was  not  continued 
at  VPI  after  I  left  Kansas,  we  nevertheless  continued  our  research  and 
discovered  an  algorithm  that  was  not  of  the  relaxation  type  and  which  gave 
considerable  improvement  over  them. 

The  beginning  part  of  this  report  briefly  describes  the  papers  which 
have  been  published  as  a  result  of  this  research.  The  main  part  of  this 
report  describes  the  results  of  the  research  done  at  VPI. 
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II  Papers  Published 

One  important  process  in  many  artificial  intelligence  problems  is  that 
of  making  empirical  knowledge  (coming  from  data)  about  the  world  consistent 
with  theoretical  or  model  knowledge  we  have  about  the  world.  Constraint 
relaxation  techniques  have  been  used  with  some  success  in  this  regard.  In 
a  paper  entitled  "Scene  Analysis,  Arrangements,  and  Homomorphisms"  presented 
at  the  1977  Workshop  on  Scene  Analysis  at  Amherst,  Massachusetts,  and 
subsequently  published  as  a  chapter  in  the  book  Machine  Vision  (Hanson  and 
Riseman,  Ed.,  1978),  we  illustrated  how  the  constraint  relaxation  as  applied 
to  scene  analysis  problems  is  really  a  technique  for  finding  homomorphisms 
from  one  arrangement  (a  set  of  labeled  relations)  to  another.  We  showed 
how  a  few  common  scene  analysis  problems  are  really  problems  in  establishing 
the  relationship  between  one  arrangement  and  another. 

The  idea  of  homomorphism  between  arrangements  and  a  discrete  relaxation 
algorithm  for  finding  them  was  discussed  in  detail  in  the  paper  "Arrangements, 
Homomorphisms,  and  Discrete  Relaxation"  (with  J.  Karten)  that  appeared  in  the 
August  1978  issue  of  the  IEEE  Transactions  on  Systems,  Man,  and  Cybernetics. 

In  a  paper  entitled  "Structural  Pattern  Recognition,  Arrangements, 
and  Theory  of  Covers"  presented  at  the  1977  Pattern  Recognition  and  Image 
Processing  Conference,  we  illustrated  how  the  concept  of  arrangements  can 
be  used  in  structural  pattern  recognition  as  an  alternative  to  syntactic 
techniques.  This  paper  was  substantially  expanded,  illustrating  how  syntactic 
techniques  depend  on  homomorphisms,  and  published  in  the  June  1978  issue  of 
Pattern  Recognition.  The  paper  was  entitled  "Structural  Pattern  Recognition, 
Homomorphism,  and  Arrangements". 


Because  relational  homomorphism  seemed  to  be  playing  such  a  large 
role  in  structural  pattern  recognition,  we  began  to  examine  algorithms  for 
computing  relational  homomorphisms.  This  lead  to  posing  a  more  general  type 


of  constraint  satisfaction  problem  and  using  more  generali-ed  discrete 
relaxation  to  make  the  tree  search  be  more  efficient.  A  paper  entitled 
"Reduction  Operations  for  Constraint  Satisfaction"  coauthored  with  Davis, 
Rosenfeld,  and  Milgram  was  published  in  the  April  1978  issue  of  Information 
Sciences . 

Further  generalizing  this  basically  combinatorial  problem  and  the 
operations  to  make  the  tree  search  more  efficient,  we  began  work  on  the 
consistent  labeling  problem.  A  talk  was  presented  at  the  1978  Princeton 
Workshop  on  Artificial  Intelligence  and  Pattern  Recognition  and  Image  Processing 
The  entire  scope  of  this  work  was  prepared  with  Shapiro  as  a  two  part  paper 
entitled  "The  Consistant  Labeling  Problem"  part  one  of  which  appeared  in  the 
March  1975  issue  of  the  IEEE  Transactions  on  Pattern  Analysis  and  Machine 
Intelligence  and  part  two  of  which  will  appear  in  the  May  1980  issue. 

At  VPI  work  continued  on  improving  the  efficiency  of  the  algorithm  for 
determining  consistent  labelings.  The  paper  "Increasing  Tree  Search  Efficiency 
for  Constraint  Satisfaction  Problems"  (with  G.  Elliott)  presented  at  the  1979 
International  Joint  Conference  on  Artificial  Intelligence  in  Tokyo  demonstrated 
a  new  algorithm  called  "forward  checking"  whose  performance  surpasses  that  of 
the  earlier  relaxation  algorithms.  The  paper 
discusses  the  details  of  this  algorithm. 
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ABSTRACT 


In  this  paper  we  explore  the  number  of  tree  search  operations 
required  to  solve  binary  constraint  satisfaction  problems.  We  show 
analytically  and  experimentally  that  the  two  principles  of  first 
trying  the  places  most  likely  to  fail  and  remembering  what  has  been 
done  to  avoid  repeating  the  same  mistake  twice  improve  the  standard 
backtracking  search.  We  experimentally  show  that  a  lookahead  procedure 
called  forward  checking  (to  anticipate  the  future)  which  employs  the 
most  likely  to  fail  principle  performs  better  than  standard  backtracking, 
Ullman's  Waltz's,  Mackworth's,  and  Haralick's  discrete  relaxation  in  all 
cases  tested,  and  better  than  Gaschnigs  backmarking  in  the  larger 
problems. 


I.  INTRODUCTION 


1  1 


Associated  with  search  procedures  are  heuristics.  In  this  paper 
we  provide  a  theory  which  explains  why  two  heuristics  used  in  constraint 
satisfaction  searches  work.  The  heuristics  we  discuss  can  be  given 
a  variety  of  one  line  descriptions  such  as: 

Lookahead  and  anticipate  the  future  in  order  to  succeed  in  the  present. 
To  succeed,  try  first  where  you  are  most  likely  to  fail. 

Remember  what  you  have  done  to  avoid  repeating  the  same  mistake. 
Lookahead  to  the  future  in  order  not  to  worry  about  the  past. 

We  will  attempt  to  show  that  for  a  suitably  defined  random  constraint 

satisfaction  problem,  the  average  number  of  tree  search  operations 

* 

which  employs  these  principles  will  be  smaller  than  that  required  by 
the  standard  backtracking  tree  search. 

To  begin  our  discussion,  we  need  a  precise  description  of  the 
constraint  satisfaction  problem  we  are  attempting  to  solve  by  a  search 
procedure.  We  assume  that  there  are  N  units  (some  authors  call  these 
variables  instead  of  units) .  Each  unit  has  a  set  of  M  possible  values 
or  labels.  The  constraint  satisfaction  problem  we  consider  is  to 
determine  all  possible  assignments  f  of  labels  to  units  such  that  for 
every  pair  of  units,  the  corresponding  label  assignments  satisfy  the 
constraints.  More  formally,  if  U  is  the  set  of  units  and  L  is  the 
sec  of  labels,  then  the  binary  constraint  R  can  be  represented  as  a 
binary  relation  on  U  x  L:  R  S  (U  x  L)  x  (U  x  L) .  If  a  pair  of  unit- 
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labels  (u...  ,Z ,u9 ,  £0)  £  R,  then  labels  Z.  and  <L  are  said  to  be 
consistent  or  compatible  for  units  and  u9.  A  labelling  f  of  all  the 
units  satisfies  the  constraints  if  for  every  pair  tt1  of  units 
(u^,  f  (u.^)  ,u.^  ,  f  (u^) )  £  R.  Haralick  et  al.  (1978)  call  such  a  labeling 
a  consistent  labeling. 

The  problem  of  determining  consistent  labelings  is  a  general 
form  of  many  problems  related  to  artificial  intelligence.  For  example, 
scene  labeling  and  matching  (Barrow  and  Tenebaum,  1976,  and  Rosenfeld 
et  al.,  1976),  line  interpretation  (Waltz,  1972),  edge  labeling 
(Haralick,  1978) ,  graph  homomorphisms  and  isomorphisms  (Ullman, 

1969),  graph  coloring  (Hararv,  1969),  boolean  satisfiability  (Haralick 
et  al.,  1978),  and  proposition  theorem  proving  (Kowalski,  1975)  are 
all  special  cases  of  the  general  consistent  labeling  problem. 

Ullman  (1966),  Waltz  (1972) ,»  Rosenfeld  et  al.  (1978  and  1979), 

Gaschnig  (1977,  1978  and  1979),  and  McGregor  (1979)  attempt  to 
find  efficient  methods  to  solve  the  consistent  labeling  problem. 

Knuth  (1975)  also  analyzes  the  backtracking  tree  search,  which  is 
the  basis  of  most  methods  used  to  solve  the  consistent  labeling 
problem. 

For  the  purpose  of  illustrating  the  search  required  to  solve 
this  problem,  we  choose  the  N-queens  problem,  how  to  place  N-queen 
on  an  N  x  N  checkerboard  so  that  no  queen  can  take  another.  Here,  the 
unit  set  corresponds  to  the  row  coordinates  on  a  checkerboard  and  we 
denote  them  by  positive  integers.  The  label  set  corresponds  to  the 
column  coordinates  on  a  checkerboard  and  we  denote  them  by  alphabetic 
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characters.  Hence,  the  unit-label  pair  (1,A, 2,D)  satisfies  the 
constraint  R,  [(1,A,2,D)  e  R],  since  a  queen  on  row  1  column  A 
cannot  take  a  queen  on  row  2  column  D.  But,  the  unit  label  pair 
(1,A, 3,C)  does  not  satisfy  the  constraint  R  because  queens  can 
take  each  other  diagonally  (see  figure  1). 

Using  the  number  letter  convention  for  unit-label  pairs.  Figure 


2  illustrates  a  portion  of  a  backtracking  tree  trace  for  the  6- 
queens  problem.  Notice  how  the  unit  5  labels  A,  C,  E,  and  F  occur 


twice  in  the  trace,  each  time  being  tested  and  failing  for  the  same 
reason:  incompatibility  with  units  1  or  2.  These  redundant  tests 

can  be  eliminated  if  the  fact  they  failed  can  be  remembered  or  if 

units  1  or  2  could  lookahead  and  prevent  5  from  taking  the  labels 

# 

A,  C,  E,  or  F.  The  remembering  done  by  Gaschnig's  backmarking 
(1977)  and  the  forward  checking  approach  described  in  this  paper 
help  eliminate  these  problems.  Notice  that  once  unit  3  takes  label 
E  (Figure  la)  the  only  labels  left  for  units  4  and  6  are  incompatible. 
The  forward  checking  algorithm  will  not  discover  this  future 
incompatibility.  However,  the  first  time  label  B  is  associated  with 
unit  4,  there  is  absolutely  no  label  possible  for  unit  6.  Hence, 
the  search  through  the  labels  for  5  and  6  are  entirely  superfluous 
and  forward  ckecking  will  discover  this  (Figure  lb).  The  lookahead 
procedures  (discrete  relaxation)  of  Ullman  (1966),  Waltz  (1972), 
Rosenfeld  (1976),  Mackworth  (1977),  and  Montanari  (1974)  help 
alleviate  the  problem  illustrated  in  Figure  la  as  well  as  in  Figure 
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Section  II  gives  a  description  of  the  full  and  partial  looking 
ahead,  forward  checking,  backchecking,  and  backmarking  procedures.  In 
section  III  we  compare  the  complexity  of  these  algorithms  as  they  solve 
the  N-queens  problem  and  problems  generated  randomly.  We  measure  com¬ 
plexity  ir.  terms  of  number  of  table  lookups  and  number  of  consistency 
checks.  These  results  show  that  standard  backtracking  is  least  efficient 
in  most  cases  and  bit  parallel  forward  checking  is  most  efficient  for  the 
cases  tried. 

In  section  IV,  we  give  a  statistical  analysis  of  constraint  satis¬ 
faction  searches  and  demonstrate  the  statistical  reason  why  forward 
checking  requires  fewer  expected  consistency  checks  than  standard  back¬ 
tracking.  In  section  V  we  explore  other  applications  of  the  fail  first 
or  prune  early  tree  search  strategies  and  show  that  such  particular 
strategies  as  choosing  the  next  unit  to  be  that  unit  having  fewest  labels 
left  and  testing  first  against  units  whose  labels  are  least  likely  to 
succeed  reduce  the  expected  number  of  consistency  tests  required  to  do 
the  tree  search.  Finally,  by  changing  the  unit  search  order  dynamically 
in  every  tree  branch  so  that  the  next  unit  is  always  the  one  with  fewest 
labels  left,  we  show  experimentally  that  performance  improves  for  each 
procedure  and  that  forward  checking  even  increases  its  computational 
advantage  over  the  other  algorithms. 


Figure  la  illustrates  how  the  labeling  A, C,E  for  units 
1,2,3  implies  that  the  only  labels  for  units 
4  and  6  are  incompatible  in  the  6  queens 
problem. 

Figure  lb  illustrates  how  the  labeling  A,C,E,8  for  units 
1,2, 3, 4  implies  that  there  is  no  label  for 
unit  6  in  the  6  queens  problem. 
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1  A 

2  A,  B 
2  C 

3  A,B,C,D 
3  E 
4  A 
4  B 

5  A,B,C 
5  D 

6  A,B,C,D,E,F 
5  E,  F 
4  C,D,E,F 
3  F 
4  A 
4  B 

5  A,B,C,D,E,F 
4  C,D,E,F 


Figure  2  illustrates  a  segment  of  a  tree  trace  that  the 
standard  backtracking  algorithm  produces  for 
a  6  queens  problem.  No  solutions  are  found  in 
this  segment.  The  entry  2  A,B,  for  example, 
indicates  that  labels  A  and  B  were  unsuccessful 
at  level  2,  but  2  C  succeeds  when  checked  with 
past  units,  and  the  tree  search  continues  with 
the  next  level. 


I.  SOME  PROCEDURES  FOR  TREE  SEARCH  REDUCING 


> 


? 


In  this  section  we  give  brief  descriptions  of  five  procedures, 
and  a  variation  of  data  structure  in  one,  which  can  be  used  within 
the  standard  backtracking  framework  to  reduce  tree  search  operations. 
They  are  called  full  and  partial  looking  ahead,  forward  checking, 
backchecking,  and  backmarking.  Each  of  these  procedures  invests 
resources  in  additional  consistency  tests  or  data  structures  at 
each  point  in  the  tree  search  in  order  to  save  (hopefully)  more 
consistency  tests  at  some  point  later  in  the  tree  search. 

For  ease  in  explaining  these  procedures,  we  call  those  units 
already  having  labels  assigned  to  them  the  past  units.  We  call  the 
unit  currently  being  assigned  a  label  the  current  unit  and  we  call 
units  not  yet  assigned  labels  the  future  units.  We  assume  the  existence 
of  a  unit-label  table  which  at  each  level  in  the  t«ree  search  indicates 
which  labels  are  still  possible  for  which  units.  Past  units  will  of 
course  have  only  one  label  associated  with  each  of  them.  Future 
units  will  have  more  than  one.  The  tree  search  reducing  procedures 
invest  early  to  gain  later.  Hence,  the  result  of  applying  any  of 
them  in  the  tree  search  will  be  to  decrease  the  number  of  possible 
labels  for  any  future  unit  or  reduce  the  number  of  tests  against 
past  units. 


II. 1  Looking  Ahead 

Waltz  filtering  (Waltz,  1972),  a  procedure  by  Ullman  (1966), 
discrete  relaxation  (Rosenfeld,  Hummel,  Zucker,  1976),  and  the 
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f  operator  of  Haralick  et  al.  (1978)  are  all  examples  of  algorithms 
that  look  ahead  to  make  sure  that  (1)  each  future  unit  has  at  least 
one  label  which  is  compatible  with  the  labels  currently  held  by 
the  past  and  present  units  and  (2)  each  future  unit  has  at  least 
one  label  which  is  compatible  with  one  of  the  possible  labels  for 
each  other  future  unit.  Looking  ahead  prevents  the  tree  search  from 
repeatedly  going  forward  and  then  backtracking  between  units  u  and  v, 
v  <  u,  only  to  ultimately  discover  that  the  labels  held  by  units 
1  through  v  cause  incompatibility  of  all  labels  between  some  unit 
iv,  u>  >  u,  and  some  past,  current,  or  future  unit. 

Because  looking  ahead  in  this  manner  cannot  remember  and  save 
most  of  the  results  of  tests  performed  in  the  lookahead  of  future 
units  with  future  units  for  use  in  future  lookaheads,  the  full  savings 
of  looking  ahead  are  not  realized  for  many  problems.  A  partial  look 
ahead  that  does  not  do  all  the  checks  of  full  look  ahead  will  perform 
better,  and  one  that  checks  only  future  with  present  units  (neglects 
future  with  futures)  will  do  much  better  because  all  tests  it  performs 
can  be  usefully  remembered. 

The  procedure  L_A_TREE  SEARCH  and  its  associated  subroutines 
CHECK-FORWARD  and  LOOK_FUTURE  (Figure  3  a,b,  and  c)  is  a  formal 
description  of  the  full  looking  ahead  algorithm,  which  can  easily 
be  translated  into  any  structured  recursive  language.  U  is  an  integer 
representing  the  unit,  and  will  increment  at  each  level  of  the  tree 
search.  It  takes  on  the  value  1  at  the  initial  call.  F  is  a  one 


dimension  array  indexed  by  unit,  where  entry  F(u)  for  unit  u  is 

the  label  assigned  to  u.  T  and  NEW_T  are  tables,  which  can  be  thought 


of  as  an  array  of  lists,  T(u)  is  a  list  of  labels  which  have  not 
yet  been  determined  to  be  not  possible  for  unit  a.  (We  implemented 
T  as  a  2  dimension  array,  with  the  number  of  entries  in  each  list 
(or  row)  stored  in  the  first  position  of  the  row.  This  implementation 
uses  approximately  (NUMBER_OF_UNITS) 2  x  ( NUMBER_OF_LABELS )  words 
of  memory  for  table  storage  since  there  can  be  NUMBER_OF_UNITS 
levels  of  recursion.)  The  tree  search  is  initially  called  with  T 
containing  all  labels  for  each  unit.  All  other  variables  can  be 
integers.  EMPTY_TABLE  and  NUMBER_OF_UNITS  and  NUMBER_OF_LABELS 
have  obvious  meanings. 

The  function  RELATION  ,t^)  returns  TRUE  if  (u 

e  R,  otherwise  it  returns  FALSE.  CHECK_FORWARD  checks  that  each 
future  unit  label  pair  is  consistent  with  the  present  label  F(u.) 
for  unit  u  as  it  copies  the  table  T  into  the  next  level  table 
NEW_T,  LOOK_FUTURE  then  checks  that  each  future  unit  label  pair 
in  NEW_T  is  consistent  with  at  least  one  label  for  every  other  unit, 
and  deletes  those  that  are  not. 

In  this  implementation  CHECK_FORWARD  and  LOOK_FUTURE  return 
a  flag,  EMPTY_ROW_FLAG,  if  a  unit  is  found  with  no  possible  consistent 
labels.  Thus  the  next  level  of  the  tree  search  will  not  be  called, 
otherwise  each  entry  in  NEW_T  is  consistent  with  u,  F(u),  and  therefore, 
all  the  past  unit-label  pairs. 

II. 2  Partial  Looking  Ahead 

Partial  looking  ahead  is  a  variation  of  looking  ahead  which  does 
approximately  half  of  the  consistency  checks  that  full  looking  ahead 
does  while  checking  future  with  future  units.  Each  future  unit- label 
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pair  is  checked  only  with  units  in  its  own  future,  rather  than  all 
other  future  units.  Thus  partial  looking  ahead  is  less  powerful 
than  full  looking  ahead  in  the  sense  that  it  will  not  delete  as 
many  unit-label  pairs  from  the  lists  of  potential  future  labels. 

We  will,  however,  see  that  partial  looking  ahead  does  fewer  total 
consistency  checks  than  full  looking  ahead  in  all  cases  tested. 

The  checks  of  future  with  future  units  do  not  discover  inconsistencies 
often  enough  to  justify  the  large  number  of  tests  required,  and  these 
results  cannot  be  usefully  remembered.  Since  partial  looking  ahead 
does  fewer  of  these  less  useful  tests,  it  is  more  efficient.  A  look 
ahead  that  checks  only  future  with  current  or  past  units  can  have 
better  performance  since  these  more  powerful  tests  can  also  be 
usefully  remembered. 

The  formal  algorithm  for  partial  looking  ahead  is  L_A_TRE E_S E ARCH 
(figure  3a),  with  the  call  to  LOOK_FUTURE  on  line  5  replaced  with 
an  identical  call  to  PARTIAL_LOOK_FUTURE  (figure  3d). 

II.  3  Forward  Checking 

Forward  checking  is  a  partial  lookahead  of  future  units  with 
past  and  present  units,  in  which  all  consistency  chec^i  can  be  remembered 
for  a  while.  This  method  is  similar  to  looking  ahead,  except  that 
future  units  are  not  checked  with  future  units,  and  the  checks  of 
future  units  with  past  units  are  remembered  from  checks  done  at  past 
levels  in  the  tree  search.  Forward  checking  begins  with  a  state  of 
affairs  in  which  there  is  no  future  unit  having  any  of  its  labels 
inconsistent  with  any  past  unit-label  pairs.  This  is  certainly  true 
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at  the  base  of  the  tree  search,  since  chere  are  no  past  units  with 
which  to  be  inconsistent.  Because  of  this  state  of  affairs,  to  get 
the  next  label  for  the  current  unit,  forward  checking  just  selects 
the  next  label  from  the  unit  label  table  for  the  current  unit.  That 
label  is  guaranteed  to  be  consistent  with  all  past  unit  label-pairs. 
Forward  checking  tries  to  make  a  failure  occur  as  soon  as  possible  in 
the  tree  search  by  determining  if  there  is  any  future  unit  having 
no  label  which  is  consistent  with  the  current  unit-label  pair. 

If  each  future  unit  has  consistent  labels,  it  remembers  by  copying 
all  consistent  future  unit-label  pairs  to  the  next  level's  unit 
label  table.  If  every  future  unit  has  some  label  in  the  unit  label 
table  which  is  consistent  with  the  current  unit-label  pair,  then  the 
tree  search  can  move  forward  to  the  next  unit  with  a  state  of  affairs 
similar  to  how  it  started.  If  there  is  some  future  unit  having  no 
label  in  the  unit  label  table  which  is  consistent  with  the  current 
unit-label  pair,  then  the  tree  search  remains  at  the  current  level  with 
the  current  unit  and  continues  by  selecting  the  next  label  from  the 
table.  If  there  is  no  label  then  it  backtracks  to  the  previous  unit 
and  the  previous  label  table. 

The  formal  algorithm  for  forward  checking  is  the  Procedure 
L_A_TREE_SEARCH  (Figure  3a)  with  line  5,  the  call  to  LOOK_Fl!TURE, 
removed.  Forward  checking  is  just  looking  ahead,  omitting  the  future 
with  future  checks. 

An  improvement  in  efficiency  can  be  gained  in  the  lookahead 
type  of  algorithms  by  using  a  data  structure  for  the  unit  -  label 
tables  that  is  suggested  by  McGregor  (1979).  McGregor  simultaneously 
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developed  a  weaker  form  of  the  forward  checking  algorithm  and  compared 
them  on  subgraph  isomorphism  problems.  In  L_A_TREESEARCH  and  CHECK_FORWARD 
the  lists  T(u)  or  NEW_T(u)  can  be  stored  as  bit  vectors,  with  one  bit  in 
the  machine  word  for  each  possible  label  (this  is  essentially  a  set 
representation).  Lines  4,  5  and  6  in  CHECK_FORWARD  (figure  3b)  can  then 
be  replaced  with  the  following  statement: 

NEW_T(U2)  -  ANP(  T(U2),  RELATI0N_BIT_VECT0R(U,L,U2)  ) 

This  single  statement  replaces  a  loop,  taking  advantage  of  the  parallel 
bit  handling  capabilities  of  most  computers.  RELATION_BIT_VECTOR 
returns  a  bit  vector  with  a  bit  on  in  each  position  corresponding  to  a 
label  tl  for  which  RELATION  [u.,£,il2  ,£2 )  would  have  been  true.  This  is 
essentially  the  set  {^l  {11,1,11^,1^)  e  R}.  Thus  NEW_T  (u.2)  becomes  the 
set  {^2  £  TCc^)  |  (.(*■,£, u2,£-2^  £  »  precisely  what  lines  4,  5,  and  6  do. 

If  the  number  of  labels  is  l<?ss  than  the  word  length  of  the  computer  used, 

then  the  relation  can  be  directly  stored  in  an  array  of  size  (NUMBER_OF_ 

2* 

UNITS)  x  (NUI1BER_0F_LABELS)  ,  and  the  tables  T  and  NEW_T  will  take 

2 

approximately  (NUMBER_OF__UNITS)  words  of  storage.  Reduced  forms  of  the 
relation  exist  for  some  problems,  such  as  the  N-queens  problem  or  the 
subgraph  isomorphism  problem,  in  which  only  two  dimensional  tables  need 
be  stored  and  a  quick  calculation  will  generate  the  needed  list.  The 
same  technique  can  be  applied  to  the  full  and  partial  looking  ahead 
algorithms,  but  they  will  not  be  compared  here  since  the  three  algorithms 
will  have  approximately  the  same  relationships  of  efficiency,  to  each 
other,  with  or  without  the  improved  data  structure. 


11. 4  Backchecking 

Backchecking  is  similar  to  forward  checking  in  the  way  it 
remembers  unit  label  pairs  which  are  known  to  be  inconsistent  with 
the  current  or  any  previous  unit  label.  However,  it  keeps  track  of 
them  by  testing  the  current  unit  label  only  with  past  unit  label 
pairs  and  not  future  ones.  So  if,  for  instance,  labels  A,  B,  and  C 
for  unit  5  were  tested  and  found  incompatible  with  label  B  for  unit 
2,  then  the  next  time  unit  5  must  choose  a  label,  it  should  never 
have  A,  B,  or  C  as  label  possibilities  as  long  as  unit  2  still  has 
the  label  B. 

Each  test  that  backchecking  performs  while  looking  back  from 
the  current  unit  u  to  some  past  unit  v,  forward  checking  will  have 
performed  at  the  time  unit  v  was  the  current  unit.  Of  course, 
at  that  time,  forward  checking  will  also  have  checked  all  future 
units  beyond  unit  < x.  Hence,  backchecking  performs  fewer  consistency 
tests,  an  advantage.  But  backchecking  pays  the  price  of  having 
more  backtracking  and  at  least  as  large  a  tree  as  forward  checking. 
Backchecking  by  itself  is  not  as  good  as  forward  checking. 

11. 5  Backmarking 

Backmarking  (defined  in  Gaschnig,  1977,  and  also  discussed  in 
Gaschnig,  1978)  is  backchecking  with  an  added  feature.  Backchecking 
eliminates  performing  some  consistency  checks  that  were  previously 
done,  had  not  succeeded,  and  if  done  again  would  again  not  succeed. 


Backmarking  also  eliminates  performing  some  consistency  checks  that 
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were  previously  done,  had  succeeded,  and  if  done  again  would  again 
succeed.  To  understand  how  backmarking  works,  recall  that  the  tree 
search  by  its  very  nature  goes  forward,  then  backtracks,  and  goes 
forward  again.  We  focus  our  attention  on  the  current  unit  u.  We  let 
v  be  the  lowest  ordered  unit  to  which  we  have  backtracked  (has  changed 
its  label)  since  the  last  visit  to  the  current  unit  u.  Backmarking 
remembers  v.  If  v  =  u,  then  backmarking  proceeds  as  backchecking. 

If  v  <  it,  then  since  all  the  labels  for  unit  u.  had  been  tested  in 
the  last  visit  to  unit  u,  any  label  now  needing  testing,  needs  only 
to  be  tested  against  the  labels  for  units  v  to  tt-1,  which  are  the 
ones  whose  labels  have  changed  since  the  last  visit  to  unit  u. 

That  is,  the  tests  done  previously  against  the  labels  for  units  1 
through  v-1  were  successful  and  if  done  again  would  again  be 
successful  because  labels  for  units  1  thorugh  v-1  have  not  changed 
and  the  only  labels  permitted  for  the  current  unit  u  are  those  which 
have  passed  the  earlier  tests  (see  figure  4). 

The  formal  algorithm  for  Backmarking  appears  in  figure  5. 

It  is  essentially  Gaschnig's  algorithm  (in  Gaschnig,  1977),  but 
modified  to  find  all  solutions.  The  variable  U  and  array  F  are  the 
same  as  in  looking  ahead.  LOWUNIT  is  a  one  dimensional  array  of 
NUMBER_OF_UNITS  entries,  and  LOWUNIT(i)  will  indicate  the  lowest 
level  at  which  a  change  of  label  has  occurred  since  the  last  time 
the  MARK  array  is  set.  MARK  is  dimensioned  NUMBER_OF_UNITS  by 
NUMBER_OF_LABELS ,  and  MARK(u,£)  will  indicate  the  lowest  level  at 
which  a  consistency  test  failed  when  the  unit  label  pair  (u,£) 
at  the  current  level  was  last  tested  against  the  previous  unit 
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label  pairs  on  previous  levels.  Ac  any  point  if  MARK(u,£)  is  less 
Chan  LOWUNIT(u)  Chen  the  algorithm  knows  that  (u.,Z)  has  already 
been  tested  against  the  unit  -  label  pairs  at  levels  below  the  value 
in  LOWUNIT(u)  and  will  fail  at  level  MARK(u.,.£),  so  there  is  no 
need  to  repeat  the  tests.  If  MARK(u,£)  is  greater  or  equal  to 
LOWUNIT(u)  then  all  tests  will  succeed  below  the  level  LOWUNIT(u) 
and  only  tests  against  units  at  LOWUNIT(u)  to  the  current  unit 
need  be  tested. 

Before  the  initial  call  to  BACKMARK,  all  entries  in  LOWUNIT 
and  MARK  are  initialized  to  1,  and  BACKMARK  is  called  with  the  initial 
u.  -  1.  Since  the  same  MARK  and  LOWUNIT  arrays  are  used  at  all  levels 
of  recursion  of  the  tree  search,  approximately  (NUMBER_OF_UNITS)  x 
(NUMBER_OF_LABELS)  words  of  table  storage  are  needed. 
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1.  RECURSIVE  PROCEDURE  L_A_TREE_SEARCH(U,F,T) ; 

2.  FOR  F(U)  =  each  element  of  T(U)  BEGIN 

3.  IF  U  <  NUMBER_OF_UNITS  THEN  BEGIN 

4 .  NEW_T  =  CHECK_FORWARD ( U , F ( U ) , T ) ; 

5.  CALL  LOOK_FUTURE(U,NEW_T) ; 

6.  IF  NEW_T  is  not  EMPTY_ROW_FLAG  THEN 

7.  CALL  L_A_TREE_SEARCH(U+1,F,NEW_T) ; 

8.  END; 

9 .  ELSE 

10.  Output  the  labeling  F; 

11.  END; 

12.  END  L_A_TREE_SEARCH; 

Figure  3a 

1.  PROCEDURE  CHECK_FORWARD (U , L , T) ; 

2.  NEW_T  =  empty  table; 

3.  FOR  U2  =  U+l  TO  NUMBER_OF_UNITS  BEGIN 

4.  FOR  L2  ■  each  element  of  T(U2) 

5.  IF  RELATION (U ,L,U2 ,L2)  THEN 

6.  Enter  L2  into  the  list  NEW_T(U2); 

7.  IF  NEW_T(U2)  is  empty  THEN 

8.  RETURN  (EMPTY_ROW_FLAG) ;  /*  No  consistent  labels  */ 

9.  END; 

10.  RETURN  (NEW_T); 

11.  END  CHECK_F0RWARD ; 

Figure  3b 

Figures  3a,  b,  and  c  express  the  full  looking  ahead  algorithm. 

Replace  the  call  to  L00K_FUTURE  at  line  5  in  figure  3a 
with  an  identical  call  to  PARTIAL_LOOK_FUTURE  (figure  3d) 
and  the  partial  looking  ahead  algorithm  is  obtained. 

Forward  checking  consists  of  figure  3a  and  3b  with  line  5 
of  figure  3a,  the  call  to  L00K_FUTURE ,  deleted  so  that  only 
CHECK_FORWARD  is  called.  Forward  checking  does  no  checks 
of  future  units  with  future  units. 
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1.  PROCEDURE  LOOK_FUTURE  (U ,  NEW__T)  ; 

2.  IF  U+l  ^  NUMBER_OF_UNITS  THEN  RETURN; 

3.  FOR  U1  =  U+l  TO  NUMB  ER_0  F_UN ITS  BEGIN 

4.  FOR  LI  =  each  element  of  NEW_T(U1) 

5.  FOR  U2  =  U+l  TO  NUMBER_OF_UNITS  except  skipping  U1  BEGIN 

6.  FOR  L2  =  each  element  of  NEW__T(U2) 

7.  IF  RELATION (Ul, LI, U2,L2)  THEN 

8.  BREAK  for  L2  loop;  /*  consistent  label  found  */ 

9.  IF  no  consistent  label  was  found  for  U2  THEN  BEGIN 

10.  Delete  LI  from  list  NEW_T(U1); 

11.  BREAK  for  U2  loop;  /*  unit  U2  has  no  label 

consistent  with  U1,L1  */ 

12.  END; 

13.  END  for  U2  loop; 

14.  END  for  LI  loop; 

15.  IF  NEW_T(U1)  is  empty  THEN  BEGIN 

16.  NEWJT  =  EMPTY_ROW_FLAG; 

17.  RETURN; 

18.  END; 

19.  END  for  Ul  loop; 

20.  RETURN; 

21.  END  LOOK_FUTURE; 


Figure  3c 


Figure  3c  is  the  L00K_FUTURE  procedure,  which  deletes  future 
labels  which  are  not  consistent  with  at  least  one 
label  for  every  unit  other  than  the  label's  own  unit. 


1.  PROCEDURE  PARTIAL_LOOK_FUTURE(U,NEW_T) ; 

2.  IF  U+l  >,  NUMBER_OF_UN IT  S  THEN  RETURN ; 

3.  FOR  U1  -  U+l  TO  NUMBER_OF_UNITS  -  1  BEGIN 

4.  FOR  LI  =  each  element  of  NEW_T(U1) 

5.  FOR  U2  =  Ul+1  TO  NUMBER_OF_UNITS  BEGIN 

6.  FOR  L2  =  each  element  of  NEW_T(U2) 

7.  IF  RELATION (Ul, LI, U2,U2)  THEN 

8.  BREAK  for  L2  loop;  /*  consistent  label  found  */ 

9.  IF  no  consistent  label  was  found  for  U2  THEN  BEGIN 

10.  Delete  LI  from  list  NEW_T(U1); 

11.  BREAK  for  U2  loop;  /*  unit  U2  has  no  label 

consistent  with  U1,L1  */ 

12.  END ; 

13.  END  for  U2  loop; 

14.  END  for  LI  loop; 

15.  IF  NEW_T  (Ul)  is  empty  THEN  BEGIN 

16.  NEW_T  =  EMPTY_ROW_FLAG; 

17.  RETURN ; 

18.  END; 

19.  END  for  Ul  loop; 

20.  RETURN; 

21.  END  PARTIAL_LOOK_FUTURE ; 

Figure  3d 

Figure  3d  is  the  PARTIAL_LOOK_FUTURE  procedure.  It  differs  from 

L00K_FUTURE  (figure  3c)  only  at  lines  1,  3,  and  5.  Each 
future  unit-label  pair  is  checked  only  with  units  in  its 
own  future. 


NUMBER  OF  CONSISTENCY  CHECKS 


UNIT  NUMBER 


Figure  4  illustrates  a  segment  of  a  tree  trace.  Because 
backmarking  remembers  from  the  first  visit  to 
unit  u  which  labels  for  u  were  compatible  with 
the  labels  for  units  1  through  v-1  (segment  A)  do 
not  have  to  be  performed.  Only  those  for  units 
v  through  u-1  (segment  B)  have  to  be  performed. 
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1.  RECURSIVE  PROCEDURE  BACKMARK (U,F, MARK, LOWUNIT) ; 

2.  FOR  F(U)  =  1  TO  NUMBER_OF_LABELS  BEGIN 

3.  IF  MARK(U,F(U) )  ^  LOWUNIT (U)  THEN  BEGIN 

4.  TESTFLAG  =  TRUE ; 

5.  I  *  LOWUNIT (U) ; 

6.  WHILE  (I  <  U)  BEGIN  /*  Find  lowest  failure  */ 

7.  TESTFLAG  =  RELATION(I,F(I) ,U,F(U) ) ; 

8.  IF  NOT  TESTFLAG  THEN  BREAK  while  loop; 

9.  1*1+1; 

10.  END  while  loop; 

11.  MARK(U,F(U))  *  I;  /*  Mark  label  with  lowest  failure  */ 

12.  IF  TESTFLAG  THEN 

13.  IF  U  <  NUMB ER_OF_UN ITS  THEN 

14.  CALL  BACKMARK (U+1,F, MARK, LOWUNIT) ; 

15.  ELSE 

16.  Output  the  labeling  F; 

17.  END ; 

18;  END  for  F  loop;  * 

19.  LOWUNIT(U)  *  U  -  1;  /*  Previous  level  will  now  change  */ 

20.  FOR  I  =  U+l  TO  NUMBER_OF_UNITS; 

21.  LOWUNIT  (I)  *  A/Ji'/(LOWUNIT(I)  ,U-1) ; 

22.  RETURN; 

23.  END  BACKMARK; 

Figure  5  is  Gaschnig's  backmarking  procedure  as  it  was  modified  to 
find  all  solutions  to  constraint  satisfaction  problems 
(see  Gaschnig,  1977). 
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III.  Experimental  Results 

In  this  section  we  compare  the  six  procedures,  partial  and 
full  looking  ahead,  backtracking,  backchecking ,  forward  checking, 
and  backmarking,  on  the  N-queens  problem  for  4  <_  N  <_  10,  and  on 
a  random  constraint  problem.  We  assume  that  the  unit  order  is 
fixed  in  its  natural  order  from  1  to  N  and  that  all  consistency 
tests  of  the  current  unit  with  past  units  or  future  units  begin 
with  the  lowest  ordered  unit.  The  label  sets  will  consist  of  all 
N  columns;  no  consideration  is  given  to  the  various  symmetries 
peculiar  to  the  N-queens  problem. 

The  random  constraint  problems  are  generated  using  a  pseudo¬ 
random  number  generator.  A  random  number  is  generated  for  each 
possible  consistency  check  for  the  relation  R,  sq  that 

each  entry  in  the  relation  will  be  made  with  probability  p.  A 
probability  of  p  *  0.65  is  chosen  so  that  problems  will  be  generated 
that  are  somewhat  similar  to  the  N-queens  problem. 

The  comparison  among  the  tree  search  reducing  procedures 
indicates  that  backtracking  is  least  efficient  in  most  cases,  and 
that  backmarking  and  forward  checking  are  more  efficient  for  the 
cases  tested.  Bit  parallel  forward  checking,  which  takes  advantage 
of  machine  parallelism,  is  the  most  efficient  for  all  cases  tried. 

Our  comparison  of  algorithm  complexity  will  be  in  terms  of  nine 
criteria  involving  number  of  consistency  tests,  number  of  table 
lookups,  and  number  of  nodes  in  the  tree  search.  There  are  a  variety 
of  ways  of  presenting  these  results  including 

1.  Number  of  consistency  tests  performed  to  obtain  all  solutions 
(figures  6  and  8). 


2.  Number  of  table  lookups  used  In  finding  all  solutions  (figures 


10  and  11). 

3.  Number  of  nodes  in  the  tree  search  to  obtain  all  solutions 
(figure  12) . 

4.  Number  of  nodes  visited  at  each  level  in  the  tree  search. 

(figure  13). 

5.  Number  of  nodes  found  to  be  consistent  at  each  level  in  the 
tree  search,  or  consistent  labelings  to  depth  (figures  17  and 
18). 

6.  Number  of  consistency  checks  at  each  level  in  the  tree  search 
(figure  19) . 

7.  Number  of  table  lookups  at  each  level  in  the  tree  search 

(figure  20). 

% 

•  • 

8.  Percentage  of  nodes  at  each  depth  that  fail  because  an  incon¬ 
sistency  was  found  at  that  depth  (figures  21  and  22). 

9.  Average  number  of  table  lookups  per  consistency  check  (figures 
23  and  24). 

Figure  6  indicates  that  the  number  of  consistency  tests 
performed  to  obtain  all  solutions  seems  to  increase  exponentially 
with  N  for  the  N-queens  problem.  The  number  of  solutions  to  the 
N-queens  probelm  also  appears  to  increase  exponentially  (see  figure 
7).  The  number  of  bit  vector  operations  is  also  shown,  for  forward 
checking  done  in  the  bit  vector  data  structure.  Though  backmarking 
appears  to  do  slightly  fewer  consistency  checks  chan  forward 
checking  on  the  N-queens  problem,  the  use  of  machine  parallelism 
gives  bit  parallel  forward  checking  a  clear  advantage  over  all  the 
other  algorithms. 

i 
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Random  constraint  problems  with  fixed  probability  .65  of  consis¬ 
tency  check  success  probability  of  0.65  are  tested  in  figures 
8  and  9.  The  number  of  consistency  tests  appears  to  grow  expon¬ 
entially  in  figure  8,  until  a  sufficiently  large  problem  size  is 
reached.  At  this  point  the  number  of  solutions  drops,  as  is 
indicated  in  figure  9,  and  the  number  of  consistency  tests  appears 
to  grow  more  slowly.  Figure  9  explains  the  unevenness  of  the 
curves  in  figure  8.  Too  few  random  relations  were  tested  for  the 
means  to  settle  closely  to  the  expected  values  for  this  type  of 
problem,  and  the  average  number  of  solutions  varies  eratically 
high  and  low. 

In  the  random  problems,  forward  checking  does  slightly  fewer 
consistency  checks  than  backmarking  in  the  larger  problem  sizes, 
and  once  again  maching  parallelism  gives  bit  parallel  forward 
checking  a  clear  advantage. 

The  number  of  table  lookups  for  the  N-queens  and  random 
relation  problems  are  compared  in  figures  10  and  11.  Only  the 
lookups  in  the  MARK  array  in  backmarking  and  backchecking,  and  the 
T  or  NEW_T  tables  in  the  lookahead  type  algorithms  are  considered. 
These  table  lookups  occur  at  line  3  in  figure  5,  backmarking, 
line  2  in  figure  3a,  line  4  in  figure  3b,  and  lines  4  and  6  in 
each  of  figures  3c  and  3d,  the  lookahead  type  algorithms.  The 
entering  of  values  into  the  tables  are  not  considered,  since  they 
always  follow  at  least  one  consistency  check,  and  never  happen 
more  often  than  consistency  checks. 
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Figure  6  compares  the  efficiency  of  the  standard  backtracking 
procedure  with  backchecking,  looking  ahead,  forward 
checking,  and  backmarking ,  for  the  N-queens  problem  in 
the  natural  unit  order.  The  number  of  bit  vector 
operations  for  bit  parallel  forward  checking  is 
also  shown. 


I 


I 


4  3  6  7  8  9  C  :l 

NUMBER  OF  UNI 73 ‘NUMBER  Or  ..ASELS'N 


Figure  3  represents  the  number  of  consistency  tests  in  the 
average  of  5  runs  of  the  indicated  programs. 

Relations  are  random  with  consistency  check  probability 
p*0.65  and  number  of  units  *  number  of  labels  *  N. 

Each  random  relation  is  tested  on  all  6  methods,  using 
the  same  5  different  relations  generated  for  each  N. 

The  number  of  bit-vector  operations  in  bit  parallel 
forward  checking  is  also  shown  for  the  same  relations. 
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Figure  9  shows  Che  average  number  of  solutions  for  the  prob¬ 
abilistic  relations  in  figure  8.  This  is  the 
average  of  5  experiments  for  each  problem  size, 
and  relations  are  random  with  consistency  check 
success  probability  p-0.65.  The  dotted  curve  is 
the  expected  number  of  solutions. 
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Figure  11  shows  Che  number  of  cable  lookups  for  the  average  of 
5  random  relacions  at  each  N,  with  number  of  units  * 
number  of  labels  ■  N,  and  probability  of  consistency 
check  success  p  =  0.65.  These  random  problems  are  the 
same  as  shown  in  figures  8  and  9. 


* igure  12  illustrates  the  number  of  nodes  in  the  treesearch  to 

find  all  solutions  to  the  N-queens  problem  for  varvir.2 

N. 
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Figure  13  shows  the  number  of  nodes  visited  at  each  depth  of 

the  treesearch  to  find  all  solutions  of  the  8-queens 
problem,  for  the  various  algorithms. 
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figure  14  shows  a  segment  of  a  tree  trace  made  by  the 
full  looking  ahead  procedure  in  a  6-queens 
problem.  One  consistent  labeling, 

IB  2D  3F  4A  5C  6E,  appears  in  this  protion  of 
the  trace.  1A  2C  fails  to  spawn  any  further 
nodes  because  the  LOOK_FUTURE  algorithm  will, 
after  deleting  several  potential  labels,  discover 
that  one  future  unit  has  no  possible  labels. 
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Figure  15  is  a  segment  of  a  tree  trace  showing  the  nodes  of  the 
tree  search  in  the  partial  looking  ahead  procedure 
in  a  6-queens  problem.  One  consistent  iabeling, 

IB  2D  3F  4A  5C  6E,  appears  in  this  portion  of  the 
trace.  1A  2C  3A,B,C,  and  D  do  not  appear  because 
CHECK_FORWARD  removes  them  from  the  table  at  nodes 
1A  and  2C.  However  1A  2C  3E  fails  to  have  successors 
because  the  only  labels  left  for  future  units  4  and  6 
are  incompatible  and  are  removed  by  LOOK_FUTURE 
(see  figure  la). 
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5  E 
4  C 


Figure  16  shows  a  segment  of  a  tree  trace  made  by  the 

forward  checking  procedure  in  a  6-queens  problem. 
One  consistent  labeling,  IB  2D  3F  4A  5C  6E,  is 
found.  Notice  that  1A  2C  3E  4B  fails  because 
the  CHECK_FORWARD  procedure  discovers  that  there 
are  no  lebels  remaining  for  unit  6  at  the  4B  node 
(see  figure  lb). 
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Backtracking  is  not  shown,  since  it  does  no  table  lookups  of 
the  type  considered.  Partial  and  full  looking  ahead  always  do 
more  table  lookups  than  forward  checking  in  these  cases,  and 
forward  checking  does  better  than  backmarking  in  the  larger 
problem  sizes.  Even  full  looking  ahead  does  fewer  table  lookups 
than  backmarking  in  the  larger  random  problems.  The  number  of 
table  lookups  into  bit  vectors  is  smaller  than  the  number  of  table 
lookups  in  other  algorithms,  when  the  bit  parallel  data  structure 
is  used  in  forward  checking. 

Figure  12  demonstrates  that  full  looking  ahead  visits  the 
fewest  nodes  in  the  tree  search,  since  it  eliminates  the  most 
potential  nodes  during  its  examination  of  future  unit-label  pairs. 
Figure  13  indicates  that  the  number  of  nodes  visited  in  the 
tree  search  is  largest  -for  the  middle  levels  in  the  tree  search, 

with  the  full  looking  ahead  procedure  having  the  fewest  nodes  at 

* 

each  level. 

Figures  14,  15,  and  16  show  segments  of  the  trace  of  nodes 
visited  by  the  full  and  partial  looking  ahead,  and  forward  checking 
algorithms  for  the  6-queens  problem.  Backmarking  and  backchecking 
will  have  the  same  node  trace  as  backtracking  (see  figure  2).  More 
detailed  trace  of  the  action  of  backmarking  can  be  found  in  Gaschnig, 
(1978  and  1979). 

Figure  17  shows  the  number  of  consistent  labelings  at  each  depth 
of  the  tree  search  for  the  8-queens  problem,  and  figure  18  shows 
the  average  number  of  consistent  labelings  for  random  problems. 

This  is  the  number  of  nodes  at  each  level  which  have  not  yet  been 
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Figure  18  illustrates  the  average  number  of  nodes  visited  at 
each  depth  of  the  treesearch  for  solutions  to  5 
random  relations  with  number  of  units  *  number  of 
labels  =  10,  and  probability  of  consistency  check 
success  of  p  *  0.65.  These  are  the  5  relations 
shown  at  the  N  -  10  case  in  figures  8,  9,  and  11. 
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Figure  19  compares  the  number  of  consistency  tests  made 
at  each  level  in  the  tree  search  for  six 
different  procedures,  for  the  3-queens  problem, 
in  the  natural  unit  order. 
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Figure  20  shows  Che  nubmer  of  table  lookups  ac  each 

possible  depth  k  in  the  treesearch  to  find  all 
solutions  to  the  8-queens  problem,  for 
the  various  algorithms. 


NODES  THAT  FAIL  DUE  TO  AN  INCONSISTENCY 


Figure  21  shows  the  percentage  of  nodes  at  a  given  depth  in 

the  tree  search  which  fail  because  some  inconsistency 
is  detected  at  that  node.  Results  are  shown  for 
the  various  algorithms  in  the  8-queens  problem. 
Backchecking  and  backmarking  often  discover  that 
there  is  an  inconsistency  by  using  table  lookups, 
rather  than  performing  all  the  checks.  Nodes 
in  the  lookahead  type  algorithms  fail  because  a 
future  unit  fails  to  have  any  remaining  labels 
after  inconsistent  future  labels  are  removed. 
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Figure  22  shows  Che  percentage  of  nodes  ac  a  given  depth 

in  the  tree  search  which  fail  because  some  incon¬ 
sistency  is  detected  at  that  node.  Results  are 
shown  for  the  average  over  5  random  relations, 
with  consistency  check  probability  p*0.65  and 
number  of  units  ■  number  of  labels  -  10. 
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found  Co  be  inconsistent.  Back.mark.ing  and  backchecking  will  have 
the  same  search  tree  as  backtracking,  and  consequently  has  the  same 
number  of  nodes  and  consistent  labelings  at  each  depth  (see  figures 
13,  17,  and  18).  Their  efficiencies  are  gained  by  reducing  the 
ammount  of  work  spent  at  each  node,  checking  against  past  units. 
However,  the  lookahead  algorithms  perform  extra  work  at  each  node 
to  reduce  the  number  of  nodes,  and  as  figures  19  and  20  show,  the 
relation  checks  and  table  lookups  for  the  lookahead  type  algorithms 
are  concentrated  more  at  the  shallow  depths  of  the  treesearch. 

As  figures  6,  8,  10,  and  11  show,  full  and  partial  looking  ahead 
do  too  much  work  at  each  node  for  the  problems  shown,  and  forward 
checking  and  backmarking  do  better. 

The  percentage  of  nodes  at  each  depth  in  the  tree  search  that 
fail  because  some  inconsistency  is  discovered  are  shown  in  figures 
21  and  22.  In  the  cases  shown,  in  the  backtracking,  backchecking, 
and  backmarking  algorithms,  over  95  percent  of  the  nodes  (instantiated 
labels)  fail  at  the  deepest  level  of  the  treesearch,  because  they 
are  inconsistent  with  some  past  unit-label  pair. 

The  lookahead  type  algorithms  reduce  the  number  of  nodes  in 
the  treesearch  in  two  ways.  First  by  removing  entries  from  the 
tables  of  potential  unit-label  pairs,  and  second  by  noticing  that 
some  future  units  may  have  no  possible  labels  associated  with 
them.  In  Figures  21  and  22  all  nodes  that  fail  in  the  lookahead 
type  algorithms  do  so  for  this  second  reason,  since  they  would  not 
have  occured  as  nodes  if  their  labels  were  deleted  from  the  future 
unit-label  tables.  If  lines  7  and  8  are  removed  from  the  CHECK 
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FORWARD  procedure  in  figure  3b,  then  no  nodes  would  fail  in  forward 
checking  (this  is  McGregor's  restricted  arc  consistency  algorithm 
in  McGregor,  1979).  This  weaker  form  of  forward  checking  algorithm 
will  find  all  the  consistent  labelings  at  each  level  of  the  tree  search 
that  backtracking  does,  but  at  a  higher  cost  than  the  original  forward 
checking  algorithm.  The  replacement  of  these  lines  in  forward 
checking  will  realize  a  15  percent  saving  of  consistency  checks  and 
table  lookups  in  the  10-queens  problem,  and  over  40  percent  savings 
in  the  10  units  by  10  labels  random  problem. 

Figures  23  and  24  address  the  question  of  what  measure  best 
determines  the  algorithmic  time  complexity.  A  careful  check  of 
all  the  algorithms  will  show  that  no  step  is  executed  more  often 
than  the  maximum  of  the  number  of  consistency  checks  or  the  number 
of  table  lookups.  As  the  problem  size  increases  in  the  lookahead 
type  of  algorithms,  the  ratio  of  table  lookups  to  consistency  checks 
seems  to  decrease  from  a  maximum  of  about  2  to  no  more  than  1.5 
table  lookups  per  consistency  check  in  both  the  N-queens  and  random 
problems.  This  ratio  is  guaranteed  to  be  greater  than  or  equal  to 
one,  by  the  algorithms  structure,  thus  either  may  be  used  as  a 
measure  of  algorithmic  time  complexity  for  the  lookahead  type 
algorithms. 

Because  in  both  the  N-queens  and  random  problems  backmarking 
seems  to  have  a  steadily  increasing  ratio  of  table  lookups  to 
consistency  checks  as  problem  size  grows,  only  table  lookups  (which 
equals  the  number  of  nodes  in  the  treesearch  in  this  case)  can  be 
used  as  a  true  measure  of  algorithmic  time  complexity  for  backmarking. 
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Only  in  the  case  that  a  computation  of  a  relation  check  is  sig¬ 
nificantly  more  expensive  than  the  cost  of  a  node's  loop  control 
and  a  table  lookup  will  relation  checks  be  a  useful  practical 
measure  for  the  time  complexity  of  backmarking.  The  reason 
that  it  is  a  practical  measure  in  this  case  is  that  the  node  and 
table  lookups  cost  will  dominate  the  cost  of  execution  only  in 
very  large  probelm  sizes,  so  large  that  the  problems  can  not  be 
solved  in  a  reasonable  time,  and  relation  tests  will  dominate  the 
cost  in  the  smaller  problems  which  can  be  solved  in  a  practical 
ammount  of  time. 
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Statistical  Model  for  Constraint  Satisfaction  Searches 


Our  statistical  model  for  random  constraint  satisfaction  is 
simple.  The  probability  that  a  given  consistency  check  succeeds 
is  independent  of  the  pair  of  units  or  labels  involved  and  is 
independent  of  whatever  labels  may  already  have  been  assigned 
to  past  units.  Hence,  P((u^+^,^.k+^,u.,£)£R|£^, . . .  are 
consistent  labels  of  u^,. . . ,u^)=P( (u^+^,Z^+^,u,l)cR)  for  every 
u,l. 

The  N-queens  problem  is  a  more  difficult  problem,  with  fewer 
solutions  but  requiring  more  consistency  tests  than  the  corres¬ 
ponding  random  constraint  problem  with  the  same  probability  of 
consistency  check  success.  A  comparison  of  the  graphs  for  the  two 
problems  in  section  III  will  show  that  while  the  numerical  values 
of  the  quantities  vary  considerably,  the  basic  character  of  the 
algorithms  operation  is  similar  for  both  problems. 

In  our  analysis,  we  will  assume  that  a  given  pair  of  units 
with  a  given  pair  of  labels  is  consistent  with  probability  p, 
p  being  independent  of  which  units,  which  labels,  or  any  past 
processing.  If  each  unit  has  the  same  number,-  M,  of  possible 

labels,  then  any  k-tuple  of  labels  for  any  k  units  has  prob- 

....  k(k-l)/2  ,  .  ,  ,  , 

ability  p  of  being  consistent  since  each  labeling  must 

satisfy  k(k-l) /2  consistency  checks.  Since  there  are  M  possible 

labelings  of  k  units,  the  expected  number  of  consistent  labelings 

is 


vk  k(k-l) /2 
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The  expected  number  of  nodes  processed  at  level  k  in  a  standard 

backtracking  search  will  be  M,  the  number  of  possible  labels,  times 

the  number  of  consistent  labelings  at  the  previous  level, 

,k— 1  (k-l)(k-2)/2 


M 


Thus  there  are 

Mk  (k-1) (k-2)/2 
M  p 


tree  search  nodes  at  level  k. 

We  can  also  count  the  expected  number  of  consistency  checks 

performed  by  backtracking.  We  expect  p ^  ^  ^  2)/2  ^eve^  ^ 

nodes  and  at  each  node  a  label  must  be  tested  for  consistency  with 

the  labels  given  the  previous  k-1  units.  The  first  consistency 

check  fails  with  probability  1-p.  If  it  fails,  we  have  spent  1 

test.  If  it  succeeds  we  have  spent  1  test  and  are  committed  to 

make  another  one  which  might  also  succeed  with  probability  p.  All 

*  (k-1) 

(k-1)  tests  will  succeed  with  probability  p  .  Hence  the 
expected  number  of  consistency  checks  performed  at  each  node  is 


k-1  .  1  . 

I  ip  (1-p)  +  (k-l)p  . 

i=l 


This  may  be  simplified  by  recognizing  the  telescopic  nature  of  the 
sum  which  is  equal  to 

i 

P  • 

i=0 


But  this  is  a  geometric  sum  and  is  equal  to 


Therefore  the  expected  number  of  consistency  checks  at  level  k 

•iiv  Jt  (k-l)(k-2)/2  ,  ,  , 

will  be  M  p  ,  the  number  of  nodes  at  level  k  times 


k-1 


1-p 


,  the  expected  number  of  consistency  checks  at  a  node. 
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making 


..k  (k-1) (k-2) /2  1  -  pr 

M  p  r^r 


consistency  checks  at  level  k.  Of  cource  the  expected  total  number 
of  consistency  checks  will  be  the  summation  of  the  expected  number 
of  consistency  checks  for  each  level  k  for  k  ranging  from  1  to  N, 
the  number  of  units. 

The  computation  of  the  number  of  labelings  for  the  forward 
checking  algorithm  is  somewhat  more  complicated  because  the  algo¬ 
rithm  stops  checking  when  a  future  unit  has  no  labels  that  are 
consistent  with  the  past  and  present  unit-label  pairs.  A  consis¬ 
tent  labeling  to  depth  k  occurs  when  the  tree  search  successfully 
reaches  a  given  label  for  unit  k  and  forward  checking  of  that  unit- 
label  pair  produces  no  future  unit  that  has  no  remaining  labels. 
Thus  the  consistent  labelings  to  depth  k  for  forward  checking  meet 
exactly  the  following  conditions: 

1)  U2^2’  '**’  are  cons^scen,:  unit-label  pairs. 

2)  There  is  no  future  unit u  in  levels  k+1,  ...,  N  for  which 
there  is  no  label  t  so  that  u.,Z  is  consistent  with 


uilv  UkV 


k (k-1) / 2 


The  k  unit-label  pairs  are  consistent  with  probability  p  , 

and  there  are  M  possible  labelings  to  depth  k  (condition  1).  A 

future  unit-label  pair  is  consistent  with  the  k  past  and  present 

k 

unit-label  pairs  with  probability  p  and  there  are  M  possible 

labels  for  a  future  unit,  so  the  probability  that  a  future  unit 

has  no  label  that  is  consistent  with  the  k  past  and  present  units 
k  M 

is  (1  -  p  )  .  Since  there  are  N  units,  there  are  N-k  future  units. 
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and  che  probability  chac  all  of  these  has  at  least  one  label  that 

k  M  N-k 

is  consistent  is  [1  -  (1-p  )  ]  (condition  2).  Thus  the  expected 

number  of  consistent  labelings  to  depth  k  for  forward  checking  is 

Mk  pk(k_1>/2  ^  _  (i_pk)M]N~k. 

The  expression  for  the  expected  number  of  nodes  in  the  forward 
checking  tree  search  at  level  k  is  very  similar  to  that  for  the 
number  of  consistent  labelings  to  depth  k,  since  each  node  will 
perform  forward  checks  to  determine  if  its  label  will  become  a 
consistent  labeling.  The  labels  for  a  node  must  meet  condition  1 
above,  but  the  future  units  are  required  to  have  succeeded  with  at 
least  one  label  only  for  checks  with  the  unit  label  pairs 
u^i*  since  each  node  was  in  the  table  for  a  consis¬ 

tent  labeling  to  depth  k-1,  and  if  any  future  unit  as  seen  from 
level  k-1  failed  to«have  a  label  then  it  would  not  have  spawned 
nodes  at  the  next  level.  Thus  in  each  node  the  future  units  will 

have  at  least  one  label  and  the  second  condition  occurs  with 

k-1  M  N-k 

probability  [1  -  (1-p  )  ]  .  Thus  the  expected  number  of  nodes 

at  depth  k  in  forward  checking  is 

Mk  k(k-l) /2  k-1  M  N-k 

M  p  [1  -  (1-p  )  ] 

A  slight  overapproximation  for  the  expected  number  of  consis¬ 
tency  checks  at  depth  k  in  the  tree  search  can  be  found  by  multiplying 
the  expected  number  of  nodes  at  the  depth  times  the  expected  number 
of  labels  remaining  for  each  future  unit  times  the  number  of  future  units, 

N-k.  Since  each  future  unit  will  have  at  least  one  label,  this 

m  k~1 
M  p 


expected  number  of  labels  will  be 


1  -  (l-pk_1)M 


Thus  the  expected 
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number  of  consistency  checks  in  forward  checking  will  be 

«k+1  p(k+2) (k-1) /2  (1  .  u.pk-l)M)N-k-l  („.kK 

The  exact  expected  value  can  be  obtained  by  replacing  the  number 

of  future  units  term,  (N-k) ,  with  the  expected  number  of  future 

units  tested,  since  forward  checking  will  stop  testing  as  soon  as 

a  future  unit  is  discovered  to  have  no  possible  labels.  Each  of 

these  tests  of  a  future  unit  will  succeed  with  probability 
k  M 

1  -  (1-p  )  ,  and  reasoning  similar  to  that  for  the  number  of 
consistency  checks  at  each  node  in  backtracking  will  give 


1  -  [1  -  (l-pk)M]N~k 

M  k.M 
(1-p  ) 


for  the  expected  number  of  future  units 


tested.  Thus  the  expected  number  of  consistency  checks  at  level  k 
in  forward  checking  will  be 

P 


..k+1  (k+2)(k-l)/2r,  „  k-l.M,N-k-l  l-[l-(l-pk)V"k  . 

M  p  [  1—  (1— p  )  ]  - — ITm - - ’ 

(1-P  ) 


The  number  of  table  lookups  in  forward  checking  is  the  sum  of 
the  number  of  consistency  checks  and  the  number  of  nodes.  Thus  the 
expected  number  of  table  lookups  at  depth  k  in  the  teee  search  will 


oe 


Mkpk(k-1)/2[i_(i_pk-1)M]N-k[i  +  .  -AP^m 


i-[i-a-pk)V~k 

(l-pk)M 


l-u-p  r 

The  expected  number  of  bit  vector  operations  with  the  bit 
parallel  data  structure  in  forward  checking  can  easily  be  found,  by 
removing  the  term  for  the  number  of  labels  remaining  for  each  future 
unit  from  the  expression  for  the  number  of  consistency  tests,  since 
only  one  operation  will  be  performed  for  each  unit,  giving 


* 
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..k  k(k-l)/2  r,  ,,  k-l,M,N-k  l-[l-(l-pk)M]N"k 

M  p  ll-(l-p  )  ]  -  kM - 

Cl— p  )‘ 

for  the  expected  number  of  bit  vector  operations  at  level  k  in 
bit  parallel  forward  checking. 

The  number  of  table  lookups  in  bit  vectors  is  still  the  sum  of 
the  number  of  bit  vector  operations  and  the  number  of  nodes,  for 
„k  pk(k-«/2  ,,.,^-1^.-1  u  * 

(1-p  ) 

table  lookups  into  bit  vectors  in  bit  parallel  forward  checking. 

To  illustrate  the  general  form  of  the  expressions  we  computed 
for  the  expected  number  of  consistency  checks  and  expected  number 
of  solutions,  we  present  a  few  graphs.  Figure  25  illustrates  the 

graph  of  the  expected  number  of  consistency  checks  as  a  function 

» 

of  tree  depth  for  a  random  constraint  satisfaction  problem  having 
N  *  17  units  and  labels  and  a  probability  p  *  0.70  of  a  constraint 
being  satisfied.  Notice  that  the  greater  number  of  tests  forward 
checking  does  early  in  the  tree  search  pays  off  later  in  the  tree 
search  both  in  terms  of  number  of  consistency  tests  and  in  number 
of  successful  instantiations  at  each  tree  depth  (figure  26). 

Figure  27  illustrates  the  expected  number  of  solutions  as  a 
function  of  N  and  p  parameters  of  a  random  constraint  satisfaction 
problem.  Increasing  N  for  a  fixed  p  eventually  causes  fewer 
solutions  to  exist  because  the  number  of  constraints  is  increasing 
quadratically . 

We,  of  course,  expect  the  number  of  consistency  tests  to  increase 
as  N  increases  and  p  remains  fixed  since  the  search  space  is  becoming 
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large  very  rapidly.  This  is  shown  for  Che  forward  checking  procedure 
in  figure  28.  Also  expected  is  for  the  average  number  of  consistency 
checks  per  labeling  to  increase  as  N  increases  and  p  remains 
fixed  (figure  29).  As  N  increases,  the  problem  of  finding  the 
first  solution  as  well  as  all  solutions  is  becoming  more  and  more 
difficult.  Therefore,  it  is  not  expected  for  the  number  of  consis¬ 
tency  tests  per  solution  to  decrease  as  the  number  of  solutions 
increases.  The  reason  for  this  is  that  as  the  number  of  solutions 
increases  more  of  the  tests  required  to  verify  a  solution  become 
shared  because  the  solutions  have  common  segments.  This  is  illus¬ 
trated  in  figure  30. 

An  experimental  check  of  the  theoretical  equations  for  the 
number  of  solutions  and  number  of  solutions  at  a  given  depth  in 
the  tree  search  with  random  relations  is  given  in  figures  31  and  32. 
Although  the  average  number  of  solutions  is  close  to  the  theoretical 
result,  the  individual  relations  vary  widely.  The  I  bars  mark  a 
distance  of  one  standard  deviation  of  the  mean  above  and  below  the 
average  of  the  trials. 

Figure  33  demonstrates  the  accuracy  of  the  theoretical  expres¬ 
sion  for  the  expected  number  of  consistency  tests  with  random 
relations  in  the  forward  checking  algorithm.  The  total  expected 
number  of  consistency  tests  shown  is  calculated  from  the  sum  of 
the  expected  number  of  tests  for  each  level  in  the  various  problem 
sizes.  For  this  expression  to  be  correct,  the  expression  for  the 
expected  number  of  nodes  at  each  level  in  the  forward  checking 
algorithm  must  also  be  correct. 


EXHtCItl)  UJMUEM  OF  CONSISTENCY  TESTS  AT  GEFTH 


Figure  26  illustrates  the  number  of  consistent  labelings  as 
a  function  of  tree  depth  for  an  N  =  17,  p  *  0.70 
random  constraint  satisfaction  problem. 


Figure  29  shows  how  che  number  of  consistency  tests  ^r  solution 
increases  as  N  increases  and  p  is  held  constant,  in 
forward  checking. 


••  ""iiwi.iijimfc,. 
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DEPTH  IN  TREE  SEARCH 


Figure  32  indicates  the  number  of  consistent  labelings  to  depth 
k  in  the  tree  search  for  the  average  of  25  random 
constraint  satisfaction  problems  with  probability 
of  consistency  check  success  of  0.65  and  number  of 
unit3  -  number  of  labels  *  10.  The  dotted  curve  is 
the  theoretical  expected  number  of  labelings  for 
such  a  problem. 
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Figure  33  shows  Che  expected  number  of  consistency  checks  for 
random  relations  with  number  of  units  =  number  of 
labels  *  N,  and  consistency  check  success  probability 
p  =  0.65.  Docs  represent  the  average  number  of  consis¬ 
tency  checks  for  25  random  relations  tested  at  each 
problem  size,  and  the  I  bars  indicate  one  standard 
deviation  of  the  mean  above  and  below  the  experimental 
mean. 
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V.  The  Fail  First  Principle 

One  of  the  strategies  which  helps  tree  searching  for  con¬ 
straint  satisfaction  problems  is  the  fail  first  or  prune  early 
strategy  of  the  looking  ahead  and  forward  checking  procedures. 
There  are  other  ways  that  we  can  apply  the  general  principle  of 
trying  to  fail  first  (and  of  course  remember  that  fact  so  that 
there  are  no  unnecessarily  repeated  mistakes).  In  this  section 
we  discuss  two  other  applications  of  this  strategy.  The  first 
is  by  optimizing  the  order  in  which  we  do  consistency  tests.  The 
fail  first  principle  states  that  we  should  first  try  those  tests 
in  the  given  set  of  tests  that  are  most  likely  to  fail  since  if 
they  do  fail  we  do  not  have  to  do  the  remainder  of  the  tests  in 
the  set. 

The  second  application  is  in  dynamically  choosing  the  optimal 
order  in  which  to  process  units  in  each  branch  of  the  tree  search. 
Optimal  unit  order  choosing,  even  on  a  local  basis,  will  not  only 
lower  the  number  of  expected  consistency  tests  per  problem  as 
compared  with  a  random  ordering,  but  it  also  lowers  the  variance 
of  this  average.  For  the  unit  order  choice,  the  fail  first  prin¬ 
ciple  states  that  the  next  unit  to  choose  should  be  that  one 
with  the  fewest  possible  labels  left. 

V. 1  Optimizing  the  Consistency  Check  Order  in  Tree  Searching 

Suppose  we  are  solving  a  constraint  satisfaction  problem  and 
suppose  units  1,...,K  have  already  been  assigned  labels 

1  N 
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and  we  are  Crying  to  find  a  label  £,^+^  for  unic  K+l.  The  label 
tv,,  must  come  from  some  set  S,,,,  of  labels  and  it  must  be  con- 

sistent  with  each  of  the  previous  labels  that  is,  we 

X  K. 

must  have  (k,£^,K+l,£K+^)  e  R  for  k  =  1,...,K.  To  determine  the 

label  ,  we  sequentially  go  through  all  the  labels  in  S 

and  perform  the  K  consistency  checks:  (k,£  , K+l,£  )  e  R.  If 

iC  is.  i  X 

one  check  fails,  then  we  try  the  next  label  in  S  .  If  all 

K+X 

checks  succeed,  then  we  can  continue  the  depth  first  search  with 
the  next  unit. 

The  optimizing  problem  for  consistency  checking  is  to  deter¬ 
mine  an  order  in  which  to  perform  the  tests  which  minimizes  the 
expected  number  of  tests  performed.  To  set  up  the  optimizing 
problem,  we  must  have  some  knowledge  about  the  degree  to  which  a 
previous  unit's  label  constrains  unit  (K+l)'s  label.  For  this 
purpose  we  let  P(k)  be  the  probability  that  the  label  £  for  unit 
k  is  consistent  with  some  label  for  unit  K+l.  We  assume  that  the 

consistency  checks  are  independent  events  so  that  the  probability 

K 

of  the  tests  succeeding  on  units  1  through  K  is  II  P(k). 

k=l 

For  each  order  of  testing,  these  probabilities  determine  the 

expected  number  of  tests  in  the  following  way.  Let  k  ,...,k  be 

X  N 

a  permutation  of  1,...,K  designating  the  order  in  which  the  con¬ 
sistency  checks  will  be  performed.  The  test  (k  ,£  ,K+1,£  )  £  R 

X  rC  ^  K  r  X 

will  succeed  with  probability  P(k^)  and  fail  with  probability 
1-P(k.,).  If  it  fails,  we  incur  a  cost  of  one  consistency  check 
and  we  try  the  next  label.  If  it  succeeds,  we  will  have  incurred 


-  65  - 


a  cost  of  one  consistency  check  and  we  are  committed  to  try  the 
next  test  O^’^k  ,K+'*',^K+1^  c  R*  T^*is  cesc  succeeds  with  prob¬ 
ability  P(k0)  and  fails  with  probability  l-P(k^).  At  this  point, 
we  have  incurred  a  cost  of  two  tests  and  may  be  committed  to  make 
more  tests  if  this  one  succeeded. 

Figure  34  shows  the  tree  of  K+l  possible  outcomes.  Since  the 
tests  are  assumed  independent,  the  probability  for  each  outcome  can 
be  computed  by  multiplying  probabilities.  For  example,  the  prob¬ 
ability  of  failing  on  consistency  check  with  unit  k^  is 
P(k^)P(k2) (l-P(k^) ) .  Also  associated  with  each  outcome  is  the 
number  of  tests  performed  to  get  there.  For  example,  failure  on 
the  test  with  unit  k^  incurs  a  cost  of  3  tests. 

The  expected  number  of  tests  C  performed  is  computed  by 

K  i-1  K 

C  =  l  i  [ 1— P (k . ) ]  H  P(k.)  +  K  ~  P(k.) 

i=l  1  j=l  3  i-1  1 

Upon  rearranging  and  simplifying  this  expression  we  obtain 

K-l  i 

C  =  1  +  y  I!  P(K.) 

i-1  j-1  3 

Now  by  the  proposition  at  the  end  of  section  V,  this  is  minimized 

by  having  k  ,  ...,k  be  any  permutation  of  1,...,K  satisfying 
X  K. 

P(k  )  <_  P(k.)  <,  ...  <.  P(k  ).  Hence,  to  minimize  expected  numbers 
of  tests,  we  must  choose  the  order  so  that  the  tests  with  units 
most  likely  to  fail  are  done  first. 

To  illustrate  the  advantage  of  using  optimum  consistency  test 
order,  we  consider  the  10-queens  problem  when  the  units  are  natur¬ 
ally  ordered  from  1  to  N  and  the  current  unit  is  K,  then  the  fail 
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first  principle  states  that  tests  with  past  units  must  be  done  in 
the  order  of  decreasing  constraints.  Since  the  row  previous  to 
row  k  has  the  strongest  constraint  on  row  k,  the  test  order  should 
be  first  unit  K-l,  then  K-2,  up  to  unit  1,  in  the  N-queens  problem. 
Backtracking  requires  1,297,488  tests  when  done  in  the  wrong  order 
(unit  1,  2,  ...,  K-l)  and  1,091,856  tests  when  done  in  the  right 
order.  It  is  interesting  to  note  that  Gaschnig's  backjumping 
procedure  (Gaschnig,  1978)  when  done  with  the  consistency  tests  in 
the  wrong  order  (1,131,942  tests)  performs  worse  than  standard 
backtracking  with  consistency  tests  in  the  right  order.  Further¬ 
more,  for  the  N-queens  problem  backjumping  with  consistency  tests 
in  the  right  order  for  the  N-queens  problem  is  equivalent  to  stan¬ 
dard  backtracking  with  consistency  tests  in  the  right  order  because 
backj  limping  bactracks  to  the  highest  level  at  which  a  failure  is 
detected,  and  there  is  always  at  least  one  lable  at  a  given  level 
which  fails  when  checked  with  the  immediately  preceding  level. 


Figure  34  illustrates  the  K+l  outcomes  of  K  tests. 


1 
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v • 2  Optimizing  Tree  Search  Order 

Every  tree  search  must  assume  some  order  for  the  units  to  be 
searched  in.  The  order  may  be  uniform  throught  the  tree  or  may 
vary  from  branch  to  branch.  It  is  clear  from  experimental  results 
that  changing  the  search  order  can  influence  the  average  efficiency 
of  the  search.  In  this  section  we  adopt  the  efficiency  criterion  of 
branch  depth  and  we  show  how  by  always  choosing  the  next  unit 
having  smallest  number  of  label  choices  we  can  minimize  the  expected 
branch  depth. 

Suppose  units  1,...,N  are  units  which  are  yet  to  be  assigned 
labels.  Let  n(m)  be  the  number  of  possible  or  available  labels  for 
unit  m.  We  assume  that  each  of  the  n(m)  label's  possible  for  unit  m 
has  the  same  probability  q  of  succeeding  and  that  success  or  failure 
of  one  of  the  labels  is  an  independent  event  from  success  or  failure 
for  any  of  the  other  labels.  Thus,  the  probability  that  a  unit  m 
will  not  have  any  label  that  succeeds  is  (1  -  q)n^m\  The  prob¬ 
ability  that  some  label  for  unit  m  succeeds  is,  therefore, 

P(m)  =  1  -  (1  -  q)n^m^.  Unfortunately,  this  analysis  holds  only 
for  the  first  level  of  the  tree. 

Let  k, ,...,k„  be  the  order  in  which  the  units  are  searched  on 
1  N 

the  tree.  Let  P  (k  |k, ,...,k  . )  be  the  conditional  probability 

n  n  1  n-1 

that  some  label  for  unit  k  will  succeed  when  unit  k  is  the  nCh 

n  n 

one  in  the  tree  search  order  given  that  units  k. ,...,k  ,  are  the 

1  n-l 

first  n-l  units  searched  in  the  branch.  We  assume  that  the  prob¬ 
ability  of  a  label  for  unit  k  succeeding  depends  only  on  the  number 


of  units  preceding  it  in  the  tree  search  and  not  upon  which 


particular  units  they  are.  That  is, 

P  (k  |k.,...,k  )  =  P  (k  \l  ) 

n  n  1  m-1  n  n  1  n-1 

for  all  labels  This  conditional  independence 

1  n-i 

assumption  justifies  the  use  of  the  notation  P  (k  )  to  designate 

n  n 

the  probability  that  some  label  succeeds  for  unit  k^  when  it  is 
the  nC^  unit  in  the  tree  search,  and  we  will  call  the  probability 
that  an  arbitrary  label  for  unit  u  will  succeed  when  checked  against 
annother  arbitrary  unit-label  pair  the  success  probability  for  unit 


u. 

Units  which  are  searched  later  in  the  tree  typically  have 
lower  probability  for  a  label  succeeding  since  the  label  must  be 
consistent  with  the  labels  given  all  the  earlier  units.  We  want 
some  way  to  compare  the  probability  of  success  for  the  same  unit 
in  different  tree  searches.  Since  the  success  probability  "depends 
only  on  the  unit  and  its  level  in  the  tree  and  since  units  later  in 
the  tree  have  lower  success  probabilities,  we  assume  that  the 
success  probability  for  a  unit  u  when  it  is  at  level  i  in  one  tree 
search  is  related  to  the  success  probability  of  unit  u  when  it  is 

at  the  first  ievel  of  another  tree  search  by  a  constant  factor 

i-1  .  _ 

a  where  0  <  a  <.  1: 

P(u)  ai_1  =  P.(u). 

l 

The  best  search  order  is  the  one  which  minimizes  the  expected 
length  or  depth  of  any  branch.  When  the  units  are  searched  in 
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Che  order  k,  ,...,k  ,  Che  expecced  branch  depch  is  given  bv 
1  m 

N-l  n 

l  +  V  n  p. (k.). 

n=l  j=l  J  J 

By  che  proposicion  ac  Che  end  of  seccion  V,  chis  is  minimized 

when  che  unic  chosen  ac  each  level  is  chac  unic  whose  success  prob- 

abilicy  is  smallesc.  Thus  ac  level  j  we  choose  unic  k^  ,  where 

P . (k . )  <  P.(u)  for  u  #  k  ,...,k.  . 

J  1  -  J  1  J-l 

Now,  P_.(k_.)  =  ^[1  -  (1  -  qn^j^)].  Since  0  <.  q  <_  1 ,  Chis 

expression  is  minimized  by  choosing  k^  Co  be  chac  unic  having  Che 

smallesc  number  of  possible  labels. 

To  illuscrace  che  advantage  of  using  a  locally  opcimal  unic 

order  for  each  branch  in  Che  Cree  search,  we  consider  Che  improv- 

ment  achieved  on  Che  N-queens  problem  and  random  relation  problems. 

The  number  of  consistency  cests  required  is  given  in  tables  1  and 

* 

2.  Some  improvement  is  shown  in  the  larger  N-queens  problems,  and 
considerable  improvement  appears  in  the  larger  random  relation 
problems.  Figure  35  demonstrates  that  che  improvement  increases 
with  problem  size  in  the  random  relation  problems  with  p=0.65  and 
number  of  units  =  number  of  labels  =  N. 

The  reason  why  optimal  unic  order  usually  improves  forward 
checking  more  than  backmarking  is  that  forward  checking  has  more 
information  about  future  units  than  backmarking.  Therefore, 
forward  checking's  choice  of  the  next  unit  most  likely  to  fail  is 
more  likely  to  produce  a  unit  which  fails  than  backmarking' s 


choice. 
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Table  1  Number  of  Consistency  Tests  in  N-queens 
Problem  for  Normal  and  Optimal  Unit  Order 


N 

Backtracking 

Sackchecking 

Full  looking 
Ahead 

Part  looking 
Ahead 

Backmarking 

Forward 

Checking 

Normal 

Normal 

Normal  Optimal 

Normal 

Optimal 

Normal 

Optimal 

Normal 

Opciaal 

4 

34 

so 

99 

99 

97 

97 

76 

76 

76 

76 

5 

405 

356 

598 

578 

485 

431 

276 

276 

282 

282 

6 

2016 

1496 

2095 

2082 

1703 

1708 

944 

921 

964 

94  6 

: 

9297 

6042 

8942 

8941 

6511 

6318 

3236 

3168 

3338 

3229 

3 

46752 

27450 

35323 

35211 

25882 

25062 

12308 

12095 

13024 

12108 

9 

243009 

131538 

153455 

151275 

112327 

106247 

50866 

50027 

55326 

49856 

10 

1297558 

643658 

661017 

636377 

496455 

449666 

220052 

211635 

242174 

205970 

Table  2  Number  of  Consistency  Tests  in  Average  of  5  Random  Constraint 
Satisfaction  Problems  with  Consistency  Check  Success  Probability 
0.65,  for  Normal  and  Optimal  Unit  Order 


N  ■  number  of  units  "  number  of  labels 


M 

Backtracking 

3ackchecklng 

Full  looking 
Ahead 

Parc  looking 
Ahead 

Backmarking 

Forward 

Checking 

Normal 

Normal 

Normal  Optimal 

Normal  Optimal 

Normal 

Opclmal 

Normal  Optimal 

4 

243 

195 

230 

223 

184 

174 

132 

133 

133  121 

5 

1043 

731 

760 

712 

599 

554 

425 

382 

414  355 

6 

4637 

2514 

2543 

2238 

1966 

1700 

1238 

1159 

1273  970 

7 

12040 

6722 

5722 

5156 

4697 

3875 

3175 

2486 

3057  2069 

8 

25893 

13490 

10779 

9306 

9111 

7233 

6089 

a  300 

5978  3425 

9 

118086 

55318 

30799 

25904 

26788 

19174 

21170 

10246 

13616  8673 

10 

163983 

73260 

44655 

36675 

41232 

2S872 

23314 

14892 

2S238  12022 
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BACXMARKING 
/^•FORWARD  CHECKING 


Figure  35  shows  the  improvement  in  efficiency  for  the  average 

of  5  random  relation  problems  with  probability  p=0.65 
and  number  of  units  a  number  of  labels  *  N,  when 
locally  optimized  unit  order  is  used.  The  relations 
are  Che  same  as  those  shown  in  figures  9  and  10. 
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Proposition:  Let  0<a<=l  be  given.  For  each  unit  u  let  P(u)  be 


its  initial  success  probability.  Let  k^,...,k  be  any  permutation 
of  1,...,N  satisfying  P (k^) <P (k0 ) < . . . <P (k^) .  Define 
P  (u)  *  a*1  1P(u) .  Then, 


N-l  n 


N-l  n 


£  !I  P .  (k . )  <_  £  n  P.  (u.)  for  anv  permutation  u  ,...,u 

-i  J  “*1^1  J  J  ' 


Proof:  Let  u  , ...,u  be  any  permutation  of  1,...,N  minimizing 


N-l  n 


l  H  P  (u  ) .  If  u  , .  .  .  ,u 
n=l  j=l  J  J 


^  we  are  done.  If  u  ,...,u^  does  not 


equal  k  , ...,k  ,  let  m  be  the  smallest  index  such  that  u  #  k  . 
IN  mm 

Also  let  m'  be  the  index  such  that  u  #  k  .  Also  let  m'  be  the 

m  in 

index  such  that  u  ,  =  k  .  Define  the  permutation  i,,...,i  by 
mm  IN 


i  = 

u 

n 

n 

i  = 

u 

m 

ra 

i  , - 

u 

m 

m 

We  will  prove  a  contradiction  by  showing  that 

N-l  n 


N-l  n 

I  n  P . (i  . ) 

U  IT 

n=l  j=l 


<  l  n  p,(u4) 

n=l  j=l 


by  looking  at  the  products.  There  are  three  cases:  n<m,  min<m' , 
and  m'<n. 

Case  1  n<m:  Here  since  i ^  =  u  ,  j=l,...,n,  we  obtain 


n  P. (i.)  =  H  P. (u.) , 


j  =  l 


1  J 


j  =  l 


J  J 
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Case  2 


Now,  P 

u  4  k 
m 


Case  3 


N 

Hence, 

n 

of  u1? . . 


ra  <  n  <  m'  : 


a 

p  (u  ) 

n 

P  (u  ,) 

n 

j=i 

p. (i.)  =  m  m 

3  3  p  (u  )  , 

mm  j 

n  p. 

=i  j 

<v  ■ 

m  m' 

P  (u  ) 
m  m 

n  p. 
3-1  3 

(u.) 

3 

(u_. 

m 

}  =  pm(k  )  <  P  (u) 
mm  m 

for 

any  u 

*  . 

km-r 

Since 

j  •  •  • 

,k  ,  P  (u  , )  <  P 

m-1  mm 

(u  ) 

m  m 

and 

n 

Pffl(u  ,) 

n 

n 

j-1 

P. (i  )  =  m  “ 
j  r  pm(u  ) 

m  m 

n 

3-1 

VV 

<  r  p 

3-1  j 

(u.) 

m'  <  n: 


rr  p  (i  ) 

j*i  2  2 


P  (u  )P  ,(u  ,)  n 
m  m  m  m' 

P  (u  )P  ,  (u  ,)  n  Pi(V 
m  m  m  m'  j=l  J  J 


Pm(i  )P  . (i  ,0  n 

m  m  m  m 

P_(u  )P  ,(u  ,)  Pj(uj} 

tn  m  m  m  j=l  J  J 


Pm(um' )Pm' (u  >  n 

P  (u  )P  ,(u  ,)  Pj(uj) 

m  m  m  raf  j  =  l  J  J 


0(m  :P(u  )am  '1p(u  )  n 
_ n_ _ m_  _  . 

.-^(u  )«m'-1P(»  )  j.!  j  J 

m  m 


-1  n 

L 


2  P  (u  ) 

J-l  J  J 


N-l  n 


L  *•  P^ui^  <  l  “  P4(uj)>  contradicting  the  minimalitv 
-1  j=l  2  2  n=l  j=l  ]  2 

•»V  Therefore,  ^ . k  . 
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VI.  Conclusion 

Using  complexity  criteria  of  number  of  consistency  checks  and 
number  of  table  lookups  we  have  shown  analytically  and  experimentally 
the  efficacy  of  the  remembering  and  fail  first  principles  in  con¬ 
straint  satisfaction  tree  search  problems.  A  new  search  procedure 
called  forward  checking  has  been  described  and  it  combined  with  opti¬ 
mal  unit  order  choice  leads  to  a  more  efficient  tree  search  than 
looking  ahead  or  backmarking.  A  data  structure  that  takes  advantage 
of  a  computer's  natural  ability  to  process  bit  vectors  in  parallel 
can  make  forward  checking  even  more  efficient.  This  suggests  that 
the  entire  set  of  look  ahead  operators  described  by  Haralick  et.  al. 
(1978)  Haralick  and  Shapiro  (1979a,  1979b),  the  discrete  relaxation 
described  by  Waltz  (1972)  and  Rosenfeld  et.  al.  (1976)  would  be  more 
efficiently  implemented  by  omitting  the  consistency  tests  required 
by  future  units  against  future  units.  Further  analytic  and  experi¬ 
mental  work  needs  to  be  done  to  determine  if  this  in  fact  is  generally 
true.  Applicability  of  the  forward  checking  idea  to  inference  and 
theorem  proving  algorithms  needs  to  be  tested  and  this  will  be  the 
topic  of  a  future  paper. 
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